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Abstract 

We consider quantum channels with two senders and one receiver. For an arbitrary such 
cliannel, we give multi-letter characterizations of two different two-dimensional capacity re- 
gions. The first region is comprised of the rates at which it is possible for one sender to send 
classical information, while the other sends quantum information. The second region consists 
of the rates at which each sender can send quantum information. For each region, we give an 
example of a channel for which the corresponding region has a single-letter description. One 
of our examples relies on a new result proved here, perhaps of independent interest, stating 
that the coherent information over any degradable channel is concave in the input density 
operator. We conclude with connections to other work and a discussion on generalizations 
where each user simultaneously sends classical and quantum information. 



1 Introduction 

A classical multiple access channel with two senders and one receiver is described by a probability 
transition matrix p{z\x,y). For the situation in which each sender wishes to send independent 
information, Ahlswede PP and Liao [2] showed that the capacity region C admits a single-letter 
characterization, given by the convex hull of the closure of the set of rate pairs (r, s) satisfying 

r < I{X-Z\Y) 
s < I{Y;Z\X) 
r + s < I{XY;Z) 

for some p{x)p{y). Further analysis by Cover, El Gamal and Salehi gives single-letter charac- 
terizations of a set of correlated sources which can be reliably transmitted over a multiple access 



1 



channel, generalizing the above, as well as Slepian-Wolf source coding and cooperative multiple 
access channel capacity. They also give a multi-letter expression for the capacity region, showing 
that an i.i.d. source [U, V) can be reliably transmitted if and only if 



H{U\V) 



< -I{X";Z"\U'^Y'') 
n 

< i/(y";Z"|l^"X") 
n 

< -I{X"Y";Z'^) 



H{V\U) 



H{U,V) 



for some n and where x" refers to the sequence of symbols (xi, . . . ,a;„). A 

similar convention has been used for sequences of jointly distributed random variables, as X" = 
{Xi, . . . , Xn). Such a characterization is of limited practical use, however, as it does not apparently 
lead to a finite computation for deciding if a source can be transmitted. 

In quantum Shannon theory, various capacities of a single quantum channel are not currently 
known to be computable in general. It is known |1] that the capacity of an arbitrary quantum 
channel for transmitting quantum information cannot be expressed as a single-letter optimization 
problem. Furthermore, the classical capacity of a quantum channel is only known to be additive 
in this sense when the encoder is restricted to preparing product states. 

Winter |S] has shown that the capacity region of a multiple access channel with classical inputs 
and a quantum output for the transmission of independent classical messages admits a single 
letter characterization which is identical in form to that of C. Results on the classical capacity 
region of quantum binary adder channels are contained in OC]. In what follows, we will examine 
the capacity region of an arbitrary quantum multiple access channel with quantum inputs and a 
quantum output, used in two distinct ways for the transmission of uncorrclatcd information from 
each terminal. Our first result describes the capacity region for the case in which one user sends 
quantum information, and the other classical. The second result characterizes the capacity region 
for the scenario in which each user wishes to send only quantum information. 

The paper is organized as follows. Section|21contains the relevant background material necessary 
to state and prove our main results. This includes mention of the notational conventions we will 
use throughout the paper, definitions of the distance measures for states we will use, as well 
as definitions of the information quantities which will characterize our rate regions. We also 
introduce two of the three equivalent information processing tasks that will be considered in this 
paper, entanglement transmission and entanglement generation. Section |31 contains statements 
of Theorems 1 and 2, the main results of this paper. We collect various relationships between 
our distance measures, a number of lemmas, and statements of existing coding propositions in 
Section 0] which also contains the proofs of Theorems 1 and 2. In Section [S] a third information 
processing scenario, strong subspace transmission, will be introduced. All three scenarios will be 
proved equivalent in that section as well. Section|Slrelates results contained in this paper to existing 
and future results. For each of the main theorems, the appendix gives an example of a quantum 
multiple access channel for which the associated description is single- letter, in the sense that it has 
a characterization in terms of an optimization of single-letter information quantities fSections 17.11 
and l7.2|l . Proofs of a new concavity result for degradable channels (Section [^J, of the convexity 
of our capacity regions fSection l7.4|l . and of the sufficiency of the bound on the cardinality of the 
set of classical message states for cq protocols (Section 17. 5|) appear there as well. 

2 Background 

A typical quantum system will be labeled A. Its Hilbert space will be Ha- The dimension of 
Ha will be abbreviated as \A\ = dimTi^. For convenience, the label A will often be shorthand 



2 



for some collection of operators on T-La when the context makes this apparent. For example, a 
density matrix p & A refers to a normalized, positive operator p: Ha Ha- We will often 
abbreviate this by writing p"^ to remind the reader of the system to which p belongs. Saying that 
Af: A ^ B is a. channel will really mean that A/": B{Ha) B{Hb) is completely-positive and 
trace-preserving. Two systems A and B may be combined with a tensor product, resulting in the 
system AB, where Hab = Ha ® He- The system A" has a Hilbcrt space Ha^ = 'H'a", and the 
various operator algebras described by A" will be appropriate subsets of S(7i®"). We will freely 
identify J\f = Af ^ 1^ , where C is any other system, in order to simplify long expressions. This 
procedure will always result in a unique completely positive map, since every channel in this paper 
will be completely positive. The maximally mixed state on a Hilbert space Ha will always be 
written as tt^ = 'PT"'^"^' ^^"^ reserve the symbol |<I>) for bipartite states which are maximally 
entangled. An exception to this convention will be made when, given a density matrix p"^ , we 
write |$p)^"^ for a purification of p^ . When we write the density matrix of a pure state we 
will freely make the abbreviation tp = 

We will use the following conventions for distance measures between states. If p and a are 
density matrices, we will write 

F{'7,p)= (^Tr^VW^) 

for (the squared version of) the fidelity [Sj. It is not hard to check that F is symmetric. For two 
pure states, this reduces to 

F(|0),|V')) = |(</>|V)P, 
while for a pure state and a mixed state, 

Fi\<P),p)^{4>\p\(f>) ^Tr(j>p. 

In this last case, we may interpret the fidelity as the success probability for a measurement which 
tests for the presence of the pure state 0, when a physical system with density matrix p is presented. 
Indeed, for a POVM {</-, 1 - 0}, 

Pr{measure 0| prepared p} ~ Tv (f>p — F{(j), p). 

The trace norm of an operator A g B{H) is defined as the sum of its singular values, and can 
be expressed as 

\A\i = Tr\/IU. 

This gives rise to another useful distance measure on states, the trace distance, defined as the trace 
norm of the difference between the states. It can be written explicitly as 

|cr - p\i = Trv/(cr-p)2, 

and carries a normalization which assigns a distance of 2 to states with orthogonal support. 

In order to introduce the information quantities which will be used to characterize our capacity 
regions, we first introduce the concept of a classical-quantum (cq) density matrix or state. Let X 
be a finite set and let X be an A"- valued random variable, distributed according to p{x). We can 
define a Hilbert space Hx with a fixed orthonormal basis {\x)-^}x£X, labeled by elements of the 
set X. This sets up an identification |-)'^ : X Hx between the elements of X and that particular 
basis. By this correspondence, the probability distribution p{x) can be mapped to a density 
matrix p = J2xexPi^)\^)(-''\ which is diagonal in the basis {|a:)}a;gAr- Further, to every subset 
Sex corresponds a projection matrix Us = J2xgs\-'^)(^\ which commutes with p. This way, 
we can express concepts from classical probability theory in the language of quantum probability, 
such as the equivalence Pt{X € S} = Tr plls- From the early development of quantum mechanics, 
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noncommutativity has been seen to be the hallmark of quantum behavior. It is to be expected 
that classical probability, embedded in quantum theory's framework, is described entirely with 
commuting matrices. 

Consider now a collection of density matrices {o'x}xex' i^idexed by the finite set X. If those 
states occur according to the probability distribution p(a;). we may speak of an ensemble {p{x), cr^} 
of quantum states. In order to treat classical and quantum probabilities in the same framework, a 
joint density matrix can be constructed 

This is known as a cq state, and describes the classical and quantum aspects of the ensemble on the 
extended Hilbert space Hx ^ Ha [HI ■ The semiclassical nature of the ensemble is reflected in the 
embedding of a direct sum of Hilbert spaces ^^^^ '^^^ into Hx ® Ha- This should be compared 
with the purely classical case, where a direct sum of one-dimensional vector spaces ^^^^ ^ 
embedded into Hx ■ Just as our classical density matrix p was diagonal in a basis corresponding to 
elements of X, the cq density matrix a is block- diagonal, where the diagonal block corresponding 
to X contains the non- normalized density matrix p{x)<7x. The classical state is recoverable as 
p = Tr^ (T, while the average quantum state is Trx cr = "^x&x '^x- We will further speak of cqq 
states, which consist of two quantum parts and one classical. When even more systems are involved, 
we will defer to the terminology cq to mean that some subsystems are classical, while some are 
quantum. Such states are not only of interest in their own right; information quantities evaluated 
on cq states play an important role in characterizing what is possible in quantum information 
theory. Now, let a be some cqq state, in block-diagonal form 

X 

We write 

H{A)„ = (cr^) = - Tr (T^ log cr^ 

for the von Neumann entropy of the density matrix associated with A, where ct^ = Tr^x cr. 
H[AB)a is defined analogously. We will omit subscripts when the state under consideration is 
apparent. The mutual information is defined as 

I{X; B) = H{X) + H{B) - H{XB). 

Depending on the context, the coherent information JO] will expressed in one of two ways. For a 
fixed joint state a, we write 

Ic{A)B) = H{B) - H{AB) = -H{A\B). 

Otherwise, if we are given a density matrix p^ and a channel JV : A' ^ B which give rise to a 
joint state (1^ (X) A/')($p), where j^p)"*"* is any purification of p, we will often use the notation 

IciA)B) = I,{p,Af) = H{U{p)) - H{{l®U){'^p)). 

It can be shown that this latter expression is independent of the particular purification |(I>p) that 
is chosen for p. 

Despite their distinct forms, the mutual and coherent informations do share a common feature. 
For a fixed input state, each is a convex function of the channel. We further remark that the 
quantity I(.{A)BX) can be considered as a conditional, or expected, coherent information, as 

h{A)BX)^ = Y,p{x)I{A)B)^^. 

X 
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A particular departure of this quantity from its classical analog, the conditional mutual information 
I{X; Y\Z), is that the latter is only equal to I{X; YZ) when X and Z are independent, while the 
former always allows either interpretation, provided the conditioning variable is classical. 

Conditional coherent information arises in another context; suppose that A/" : A' — > XB is a 
quantum instrument meaning that Af acts as 

AT: r^~>^|x)(a;|''®AA,(r). 

X 

The completely positive maps {Mx} sltc the components of the instrument. While they arc generally 
trace reducing, their sum Af = Afx is always trace preserving. It is not difficult to show that 

I,ip,Af)^lM)BX), 

where the latter quantity is computed with respect to to the state 

^p(x)|.T)(a;|^®(l^®AA,)($^^'). 

X 

For us, a quantum multiple- access channel is a channel Af : A' B' — > C with two inputs and 
one output. We will assume that the inputs A' and B' are under the control of Alice and Bob, 
respectively, and that the output C is maintained by Charlie. We will present three different 
quantum information processing scenarios which, as we will see, lead to equivalent cq and qq 
capacity regions. 

Classical-Quantum (cq) protocols These protocols will be relevant to Theorem ^ below. 
Using a large number n of instances of A/", Alice tries to send classical information to Charlie 
at rate r, while Bob simultaneously attempts to convey quantum information at rate S. Alice's 
communication is in the sense that she tries to send Charlie one of 2""" equiprobable classical 
messages, represented by the uniformly distributed random variable M . To this end, we allow 
her to prepare arbitrary pure states \4>m)^ at her input A'" to the channel. It is assumed that 
neither Alice nor Bob shares any additional resources with Charlie or among themselves, such as 
entanglement or noiseless quantum channels. We consider three different information processing 
tasks which Bob can perform, introduced in order of apparently increasing strength. The first 
two, entanglement generation and entanglement transmission, are outlined below, as each plays 
an essential role in the proof of our main result. The third, strong subspace transmission, is 
described in Section IKTI While not essential for the understanding of our main results, we include 
it in this paper because the composability properties implied by its more stringent constraints 
on successful communication make it particularly attractive as a building block for creating more 
intricate protocols from simpler ones. That each of these aforementioned scenarios can justifiably 
be considered as "sending quantum information" to Charlie will be proved in Sections 15 . 21 and l5 . 31 
where we will show that each gives rise to the same collection of achievable rates. 

I - Entanglement generation With the goal of eventually sharing near maximal entangle- 
ment with Charlie, Bob begins by preparing a bipartite pure state |T)'^^ , entangled between a 
physical system B located in his laboratory, and the B'" part of the inputs of J^f®"^ , Charlie's post- 
processing procedure will be modeled by a quantum instrument. While the outer bound provided 
by our converse theorem will apply to any decoding modeled by an instrument, our achievability 
proof will require a less general approach, consisting of the following steps. 

In order to ascertain Alice's message M, Charlie first performs some measurement on C" whose 
statistics arc given by a POVM {Km}me2-^^- We let the result of that measurement be denoted 
M, his declaration of the message sent by Alice. Based on the result of that measurement, he will 
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perform one of 2"'" decoding operatfons : C" B. These two steps can be mathematically 
combined to define a quantum instrument : C" — > MB with (trace-reducing) components 

The instrument acts as 

2"'' ^ 
T):t^Y^ |m)(m|*^®I?„(r), 

m—1 

and induces the trace preserving map T): C" B. acting according to 

2"'' 
m—1 

We again remark that this is the most general decoding procedure required of Charlie. Any 
situation in which he were to iterate the above steps by measuring, manipulating, measuring again, 
and so on, is asymptotically just as good as a single instance of the above mentioned protocol. 
({|<^m)}mG2"'^, jY^SB ^ -p) will bc callcd a (2"'', 2""^, n, e) cq entanglement generation code for the 
channel M if 

2-"'-5]PiK|T))>l-6, (1) 

m—1 

where 

Pi(m, |T)) = F (|m)|$)^^,I? o AA«"(0f • ® T^^'")) . (2) 

We will say that (r, S) is an achievable cq rate pair for entanglement generation if there exists 
a sequence of (2"'", 2"'^, n, e„) cq entanglement generation codes with e„ — > 0. The capacity region 
CQi{J\f) is defined to be the closure of the collection of all achievable cq rate pairs for entanglement 
generation. 

II - Entanglement transmission In this scenario, rather than generating entanglement 
with Charlie, Bob will act to transmit preexisting entanglement to him. We assume that Bob 
is presented with the B part of the maximally entangled state |$)^^. It is assumed that he has 
complete control over J5, while he has no access to B. He will perfo£m a physical operation in 
order to transfer the quantum information embodied in his system B to the inputs _B'" of the 
channel, modeled by an encoding operation £: B ^ B'". The goal of this encoding will be to 
make it possible for Charlie, via post-processing of the information embodied in the system C", 
to hold the B part of a state which is close to that which would have resulted if Bob had sent his 
system through a perfect quantum channel id: B —^ B. Here, we imagine that B and B denote 
two distinct physical systems with the same number of quantum degrees of freedom. The role of 
the identity channel is to set up a unitary correspondence, or isomorphism, between the degrees 
of freedom of B in Bob's laboratory and those of B in Charlie's. We will often tacitly assume 
that such an identity map has been specified ahead of time in order to judge how successful an 
imperfect quantum transmission has been. This convention will be taken for granted many times 
throughout the paper, wherein specification of an arbitrary state IvP)^^ will immediately imply 
specification of the state j^I')^-^ = (1^ (E) id)|\I')^-^. Decoding is the same as it is for scenario I. 

({|(/)m)}me2"'-,f,f ) will be called a (2"'', 2"^^, n, e) cq entanglement transmission code for the 
channel JV if 

2"'' 

2-^Y.Ps\^)>i-e, (3) 

m—1 
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where 

P]I(to) =F(|m)|$)^^,X)oAA«"(,^;^'"®£($^^)) . (4) 

Achievable rate pairs and the capacity region CQu{Af) are defined analogous to those for scenario I. 

Scenario III will be introduced in Sectional where it will also be shown that all three scenarios 
gives rise to the same set of achievable rates. For this reason, we will henceforth only speak of a 
single capacity region 

CQ{N) = CQiiM) = CQuiN) = CQmiN). 

Quantum-Quantum protocols The subject of Theorem|21 these protocols concern the case in 
which Alice and Bob each wish to send only quantum information to Charlie at rates R and S, 
respectively. As in the cq case, we will initially describe two different senses in which such a task 
can be considered. Again, Section [3 will introduce a third scenario, which will be shown to be 
equivalent to the following two. 

I - Entanglement generation For encoding, Alice and Bob respectively prepare the states 
ITi)-^^'" and [Ta)^^'", entangled with the A'" and S'" parts of the inputs of A/"*^". Then goal is 
to do this in such a way so that Charlie, after applying a suitable decoding operation V : C" AB, 
can hold the li? part of a state which is close to |$i)^^|$2)-^^. Formally, (|Ti)^^'" , jTa)-^^" , X>) 
is a (2"^, 2"^ , n, e) qq entanglement generation code for the channel A/" if 

F(|$i)|$2>,I?oAA«5"(Ti®T2)) > (5) 

As before, {R, S) is an achievable qq rate pair for entanglement generation if there is a sequence 
of (2"^, 2"'^, n, e„) qq entanglement generation codes with e„ 0, The capacity region Qi(A/') is 
the closure of the collection of all such achievable rates. 

II - Entanglement transmission Alice and Bob each respectively have control over the A 
and B parts of the separate maximally entangled states |$2)'^^, while neither has access 
to A or B. Alice transfers the correlations in her system to the A'" parts of the inputs of A/"*^" 
with an encoding operation £i: A ^ A'". Bob acts similarly with £2: B ^ Their goal is to 
preserve the respective correlations, so that Charlie can apply a decoding operation T): C" AB, 
in order to end up holding the AB part of a state which is close to |$i)'^'^|$2)^'^- Formally. 
{£i,£2,'D) is a (2"^, 2""^, n, e) qq entanglement transmission code for the channel A/" if 

F(|$i)|$2),2?oAA®"o(^^i«)f2)($i«)$2)) > (6) 

Achievable qq rate pairs for entanglement generation and the capacity region Qii(A^) are defined 
as in the previous scenario. As in the cq case, we defer to Section [S] the introduction of scenario 
III, as well as the proof that 

Q(AA) = Qi(AA) = Qu{N) = Qiii(AA). 

3 Main results 

Our first theorem gives a characterization of CQ{M) as a regularized miion of rectangles. 

Theorem 1. Given a quantum multiple access channel M : A! B' C , its cq capacity region 
CQ{M) is given by the closure of 

00 ^ 

M ^CS(i)(AA«'=), 

k=l 
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where CQ^^\A4) equals the pairs of nonnnegative rates (r, S") satisfying 

r < I{X;C), 
S < h{B)CX)„ 

for some pure state ensemble {p{x), \4'x)^ }xex o,nd a bipartite pure state j^*)^^ giving rise to 

a^BC ^J2p{x)\x){x\'' ^M{cl)x®^). (7) 

X 

Furthermore, it is sufficient to consider \X\ < mm{\A'\, |C|}^ + 1 when computing CQ^^\ 

The next theorem offers a characterization of Qi-hf) as a regularized union of pentagons. 

Theorem 2. Given a quantum multiple access channel M : A' B' C , its qq capacity region Q{Af) 
is given by the closure of 

k=l 

where Q^^'^{A4) equals the pairs of nonnegative rates {R,S) satisfying 

R < Ic{A)BC)a 
S < h{B)AC), 
R + S < Ic{AB)C)a 
for some bipartite pure states j'l'i)'^"^ and {'^2)^^ giving rise to 

<j^^^ ^ {1^^ <g,M){-^i<E)^2)- (8) 

We remark here that there does not appear to be any obstacle preventing application of the 
methods used in this paper to prove many-sender generalizations of the above theorems. For 
simplicity, we have focused on the situations with two senders. It should also be noted that the 
characterizations given in each of the above theorems do not apparently lead to a finite computation 
for determining the capacity regions, as neither admits a single-letter characterization in general. 
However, as an application, it is proved in Section 17.11 that the cq capacity region for a certain 
quantum erasure multiple access channel does in fact have a single-letter region, given by the set 
of all pairs of nonnegative classical-quantum rates (r, S) satisfying 

r < H{q) 

S < {l-2q)\ogd 

for some < g < 5. This region is pictured in Figure ^ for the case in which d=2. In Section Ol 
wc define a family of "collective phase flip channels" which, with probability p, flip the phases of 
both input qubits. We show there that the qq capacity region of such a channel is given by a single 
pentagon of nonnegative qq rates (i?, S) satisfying 

R < 1 
S < 1 
R + S < 2~H{p). 

The characterizations given in Theorems 1 and 2 are not the only possible ways to describe the 
corresponding regions. It is possible to prove coding theorems and converses for regularizations 
of distinct single-letter regions for each of CQ{M) and QiM). We refer the reader to Sectional 
for further discussion regarding other characterizations of these regions. We finally mention that, 
contrary to the corresponding result for classical multiple access channels, the regions of Theorems 
1 and 2 do not require convexification. That this follows from the multi-letter nature of the regions 
will be demonstrated in the appendix. 
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Figure 1: CQ(erasure channel) 

4 Proofs of Theorems 1 and 2 

We first collect some relevant results which will be used in what follows, starting with some 
relationships between our distance measures. If p and a are density matrices defined on the same 
(or isomorphic) Hilbert spaces, set 

F^F{p,a) andT= \p-a\i. 

Then, the following inequalities hold (see e.g. \14\ ) 

1-Vf < T/2 < VT^, (9) 
1-T < F < 1-TV4. (10) 

From these inequalities, we can derive the following more useful relationships 

F>l-e => T<2y^ (11) 
T<e F>l-e, (12) 

which are valid for < e < 1. Uhlmann |S] has given the following characterization of fidelity 

F{p,a) = max \{^,\<^>.)\' = max|(vl/,|$,)|2 

l*P>.l*<.) I*p> 

where the first maximization is over all purifications of each state, and the second maximization 
holds for any fixed purification of a. This characterization is useful in two different ways. 
First, for any two states, it guarantees the existence of purifications of those states whose squared 
inner product equals the fidelity. Second, one can derive from that characterization the following 
monotonicity property |12| associated with an arbitrary trace-preserving channel Af 

F{p,a) < F{N{p)M{<T))- (13) 

An analogous property is shared by the trace distance 

\p-a\^ > |AA(p)-AA(a)|i, (14) 

which holds even if J\f is trace-reducing. A simple proof for the trace-preserving case can be found 
in |14j . These inequalities refiect the fact that completely-positive maps are contractive and cannot 
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improve the distinguishability of quantum states; the closer states are to each other, the harder it 
is to tell them apart. We will often refer to either of these two properties as just "montonicity," as 
the particular one to be used will always be clear from the context. Another useful property will 
be the multiplicativity of fidelities under tensor products 

Fipi «) P2, CTi «) (72) = F{pi,(7i)F{p2,(72). (15) 

Since the trace distance is a norm, it satisfies the triangle inequality. The fidelity is not a norm, 
but it is possible to derive the following analog by applying @ and ((TUIl to the triangle inequality 
for the trace distance 



Fipi,P3) > 1 - 2 v/1 - F{p,,p2) - 2^1 - F{p2, P3). (16) 

It will be possible to obtain a sharper triangle-like inequality as a consequence of the following 
lemma, which states that if a measurement succeeds with high probability on a state, it will also 
do so on a state which is close to that state in trace distance. 

Lemma 1. Suppose p, cr, A G B{Ti.), where p and a are density matrices, and < A < 1. Then, 
TrAcT > TrAp- \p-(j\i. 

Proof: 

Tr Act = Tr Ap - Tr A{p - a) 

> Tr Ap - max 2 Tr A(p - cr) 

~ 0<A<1 

= TrAp- |p-cr|i, 

where the last equality follows from a characterization of trace distance given in ^1] . ■ 
Since F((f>,p) = Tr0p when </> is a pure state, a corollary of Lemma^is that 

F{cj),a)>Fi(b,p)~\p~a\i, 

a fact we will refer to as the "special triangle inequality." 

The following lemma can be thought of cither as a type of transitivity property inherent to any 
bipartite state with a component near a pure state, or as a partial converse to the monotonicity 
of fidelity. 

Lemma 2. For finite dimensional Hilbert spaces Ha and Hb, let |</))^ € Ua he a pure state, 
p^ G B{'Hb) a, density matrix, and Vl"^^ G B{TCa ® "He) a density matrix with partial traces 
= Trs and = Tya ^, for which 

f(</),r!^)>l-e. (17) 

Then 

F{(j) ^p,n)>l-\p-n^\i- 3e. 
Proof. We begin by defining the subnormalizcd density matrix uj via the equation 

(0® i)r2(0® 1) = (18) 

which we interpret as the upper- left block of fi. when the basis for Ha is chosen in such a way that 
10) = (1,0,..., 0)^. Notice that ^(0, Trs f7) = Ttuj = f. Writing the normalized state uj = uj/f, 
we see that it is close to uj in the sense that 

\uj~uj\i < e\uj\i 

< e. (19) 
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Now we write 



Tr ^{l® ® ® 1)(1 ® Vp) 



Trw'(l(g) Vp) 



Ti-yJ(f>(E>{y/pujy/p) 



Tr J ^ujy/p 



> 



V fF{uJ,p) 
v//(l-|c.-p|i). 



(20) 



The first line is the definition of fidelity and the third follows from (|18|) . The last equality relies 
on the fact that the fidelity, as we've defined it, is linear in either of its two inputs, while the 
inequality follows from ((Tn|) . 

Noting that ^l^ >w, we define another positive operator ui' = fl^ — ui, which by (|17(l satisfies 
Trtj' < e and can be interpreted as the sum of the rest of the diagonal blocks of 17. The trace 
distance in the last line above can be bounded via double application of the triangle inequality as 



This continuity lemma from jJB] shows that if two bipartite states are close to each other, the 
difference between their associated coherent informations is small. 

Lemma 3 (Continuity of coherent information). Let p'^^ and a'^^ be two states of a finite- 
dimensional bipartite system QR satisfying \p — (t|i < e. Then 



where H{e) is the binary entropy function. 

Next is Winter's "gentle measurement" lemma |16| . which implies that a measurement which 
is likely to be successful in identifying a state tends not to significantly disturb that state. 

Lemma 4 (Gentle measurement). Let H be a finite dimensional Hilbert space. Lf p E B{Ti) is 
a density matrix and A G B{Ti.) is nonnegative with spectrum bounded above by 1, then 



\p-Uj\l < \p- {p - Uj')\l + \{p- Uj') - Uj\l + - Lli\i 

< Ti-Lj' + \p-n^\^+e 

< |p-0^|^+2e, 

where the second line follows from H19|l . Combining l|2()|) with (|2H) . we obtain 



(21) 



F(0(gp,r!) > (l-e)(l-|p-r!^|i 
> 1 - |p- 0-^1^ - 3e. 



2e) 



\Ic{Q)R)p - Ic{Q)R)a\ < 2H{e) + 4\og\Q\e, 



Tr pA > 1 - e 



implies 
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Our coding theorems for multiple access channels will make use of existing coding theorems for 
single-user channels. {£,T>) is a (2"^,n, e) entanglement transmission code for the channel A/" if, 
for the 2"^ x 2"^^ maximally entangled state |$)"^"^, we have 

F(|$),I?oAA^"o£($)) > 1-e. 

A (2"^, n, e) random entanglement transmission code consists of an collection of deterministic 
(2"^, n, e) entanglement transmission codes {Ep, Dp) and a probability distribution P^, correspond- 
ing to a source of shared common randomness available to both sender and receiver. We will often 
omit the subscript, once the randomness of the code has been clarified, and it will be imderstood 
that £ and V constitute a pair of correlated random maps. 

Associated to a random code is its expected, or average code density operator E£(7r"^), which 
is the expectation, over the shared randomness, of the image of the maximally mixed state on A. 
Our reason for using random quantum codes will be to ensure that, on average, the input to J\f^"- 
is at least close to a n-fold product state. 

The proof of the existence of quantum codes achieving the coherent information bound is at- 
tributed to Lloyd 23, Shor ^S] and Devetak The following quantum coding proposition for 
single-user channels is proved in |19| and concerns the existence of random entanglement transmis- 
sion codes whose average code density matrix can be made arbitrarily close to a product state. 

Proposition 1. Given is a channel M : A' ^ B, a density matrix , and a number < R < 
Ic{p,M). For every e > 0, there is n sufficiently large so that there is a (2"^,n, e) random en- 
tanglement transmission code for M with an isometric encoder £ and average code density 
operator =W,£{t:^) satisfying 

I^?-P^"|i<e. 

Furthermore, given any particular isometric extension Uf^f. A' BE ofAf, it is possible to choose 
isometric extensions U!^ : i?" — > AF of the deterministic decoders so that 

F{\'^>)^^\X)^"^,U^oU^" o£''\<i>)'^^) >l-e (22) 

for some fixed pure state |A)^ ^ . 

Next, we state an average error version of the HSW Theorem for cq codes with codewords 
chosen i.i.d. according to a product distribution |21l I2()j . 

Proposition 2 (HSW Theorem). Given is a cq state = p{x)\x) {x\''^ ^ p'^ and a number 
< R < I{X]Q)„. For every e > 0, there is n sufficiently large so that if 2"^ codewords C = 
{X"(m)} are chosen i.i.d. according to the product distribution p{x'^) = Y\^^iP{xi), corresponding 
to input preparations 

Px^ = Pxx®---® Px^, 

there exists a decoding POVM {A^} on Q" , depending on the random choice of codebookC, which 
correctly identifies the index m with average probability of error less than e, in the sense that 

Ec 2-"^ PX"(,„) A™ > 1 - e. (23) 

m— 1 

Due to the symmetry of the distribution of C under codeword permutations, it is clear that the 
expectations of each term in the above sum are equal. In other words, 

Ec2-"« Trpx"MA™ =EcTrpx"(i)Ai, (24) 

m— 1 

SO we will later, without loss of generality, make the assumption that Alice sends codeword ]\I = 1 
during our analysis (see [52] for a detailed discussion in the classical case). 
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Proof of Theorem^ (converse) We prove in Section O that any rate pair which is achievable 
for entanglement transmission is also achievable for entanglement generation. For this reason, we 
use the latter scenario to prove the converse part of Theorem^ It should be noted that the reverse 
implication, namely that entanglement generation implies entanglement transmission, follows from 
the fact the outer bound to be proved next coincides with the inner bound obtained by the coding 
theorem below. 

Suppose there exists a sequence of (2"'', 2""^, n, e„) entanglement generation codes with e„ 0. 
Fixing a blocklength n, let {|0m)}™62"'-, |T)^^ " and T> comprise the corresponding cq entangle- 
ment generation code. The state induced by the encoding is 

2'"' 
m— 1 

After application of the decoding instrument X>: BM^ this state becomes 

An upper bound on the classical rate of the code can be obtained as follows: 

nr = H{M)n 

= I{M;M)n+HiM\M)n 

< I{M;M)n+H{ 

< I{M;C")^ + ne'^. 

The first inequality follows from Fano's inequality (see e.g. [21]) while in the second we use the 
Holevo bound |23| and define = ^ + T^n ■ The quantum rate of the code is upper bounded as 

Ic{B)C'"M)^ > Ic{B)BM)n 

> Ic{B)B)n 

> IciB)B)^-2H{en)-8nS^ 
= nb ~ ne„. 

Above, the inequalities are consequences of the data processing inequality ^Hl; the fact that con- 
ditioning cannot increase entropy (and thus cannot decrease coherent information) |14| . a combi- 
nation of LemmaEland Hll() . and the definition e" = ;^ + nS^JT^. The second justification can be 
considered as an alternative statement of the well-known strong subadditivity inequality 1241 . of 
which a recent simple proof can be found in |25) . Setting X = M, we have thus proven that 

r< i/(X;C")-He:„ S <-h{B)C^ X) ^ ,'1^ 
n n 

whenever {r, S) is an achievable cq rate pair for entanglement generation, where e^,e^ 0. It 
follows that for any achievable rate pair (r, S) and any 5 > 0, we have 

(r - (5,5 - (5) e -CQ(i)(AA®") c CQ(AA). 
n 

Since CQ{Af) is closed by definition, this completes the proof. 
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Proof of Theorem ^ (achievability) Our method of proof for the coding theorem will work 
as follows. We will employ random HSW codes and random entanglement transmission codes to 
ensure that the average state at the input of A/"®" is close to a product state. Each sender will 
utilize a code designed for the product channel induced by the other's random input, whereby 
existing coding theorems for product channels will be invoked. The quantum code used will be one 
which achieves the capacity of a modified channel, in which the classical input is copied, without 
error, to the output of the channel. As the random HSW codes will exactly induce a product state 
input, the existence of these quantum codes will follow directly from Proposition ^ 

The random HSW codes will be those which exist for product channels. As random entan- 
glement transmission codes exist with average code density matrix arbitrarily close to a product 
state, this will ensure that the resulting output states are distinguishable with high probability. 
Furthermore, obtaining the classical information will be shown to cause but a small disturbance in 
the overall joint quantum state of the system. As we will show, it is possible to mimic the channel 
for which the quantum code is designed by placing the identities of the estimated classical message 
states into registers appended to the outputs of each channel in the product. 

The decoder for the modified channel will then be shown to define a quantum instrument which 
satisfies the success condition for a cq entanglement transmission code, on average. This feature 
will then be used to infer the existence of a particular, deterministic code which meets the same 
requirement. 

Fix a pure state ensemble {p{x), } and a bipartite pure state 1*)^"^' which give rise to 

the cqq state 

w^^"*^ = ® (1^ ® AA)((/.f (g) *^"^'), 

X 

which has the form of Define pi = J2xPi^)4'x and = Tr^ 'I'. We will demonstrate 
the achievability of the corner point {I{X;C), Ic{B" )CX))^ by showing that for every e,S > 0, 
if r = /(X;C)^ - 5 and 1^3" )CX)^ - S, there exists a (2"^ 2"'^, n, e) cq entanglement 

transmission code for the channel Af, provided that n is sufficiently large and that S > 0. The rest 
of the region will follow by timesharing. 

For encoding, Alice will choose 2"'' sequences X"(m), i.i.d. according to the product distribution 
p{x"') = n"=i-P(^i)- As each sequence corresponds to a preparation of channel inputs \4im)^ = 
\4>Xi(m)) ® • ■ ■ ® \4>x„{m))^ the expected average density operator associated with Alice's input to 
the channel is precisely 

m— 1 a:" 

Define a new channel A/2 : B' ^ CX (which is also an instrument) by 

p^Yp{x)U{(I)x ®p)® |a;)(a;|'^. 

X 

This can be interpreted as a channel which reveals the identity of Alice's input state to Charlie, with 
the added assumption that Alice chooses her inputs at random. Alternatively, one can view this as 
a channel with state information available to the receiver, where nature is randomly choosing the 
"state" X at Alice's input. Observe that Ic{p2,J^2) = Ic{B" )CX). By Proposition^ there exists 
a (2"'^,ri,, e) random entanglement transmission code {£,!), (3} for the channel A/'2, with average 
code density operator = ¥,^£{77) satisfying 

\Q-pr\i<e. 

Now, by PropositionEl for the channel Mi : p ^-^ M{p® P2) which would result if Bob's average 
code density operator were exactly equal to p^", there exists a decoding POVM {^m\m£2^^ which 
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would identify Alice's index m with expected average probability of error less than e, in the sense 
that 

Ec2""'-^TrA,„T;„>l-e, 

m=l 

where 

By the symmetry of the random code construction, wc utilize (j24|l to write this as 

Ec TrAiT^ > 1 - e. 
Define the actual output of the channel corresponding to 71/ = m as 

as well as its extension 

where |$)-^^ is the maximally entangled state which Bob is required to transmit. Note that 

It follows from monotonicity of trace distance that 

\EpTi-T[\^ < e, 
which, together with Lemma ^ implies that 

Ec 2-"'' J2 Tr A™ E^ t„ = Ecp Tr Ain > 1 - 2e. 

rn — l 

This allows us to bound the expected probability of correctly decoding Alice's message as 

Ec/3Tr(l(8Ai)a > l-2e. (25) 

In order to decode, Charlie begins by performing the measurement {Am}mi£2"^- He declares 
Alice's message to be M = m if measurement result m is obtained. Charlie will then attempt 
to simulate the channel A/"^", by associating a separate classical register Xi to each channel 
J\f: A[ d in the product, preparing the states \Xi{m))-^% for each \ <i <n. Additionally, he 
stores the result of the measurement in the system M, his declaration of the message intended by 
Alice. This procedure results in the global state 

2"'" 
m— 1 

Let j;BC"X'^ _ rpj.__^ j£ Q]-^j^j.jjg -^ygj-g able to perfectly reconstruct Alice's classical message, F 
would instead be ^ 

^' = 6®|x"(l))(x"(l)|^"®|l)(l|^^ 

with T' = Trjj F'. When averaged over Alice's random choice of HSW code. T' is precisely equal 
to the state which would arise via the action of the modified channel A/'2 . This is because 

= Arf"o£:($), (26) 
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where we have written the state which results when AUce prepares (jjx" as 

However, our choice of a good HSW code ensures that he can almost perfectly reconstruct Alice's 
message. A consequence of this will be that the two states T and T' are almost the same, as we 
will now demonstrate. 

In what follows, we will need to explicitly keep track of the randomness in our codes, by means 
of superscripts which are to be interpreted as indexing the deterministic codes which occur with 
the probabilities Pc and Q/s. Rewriting H25|l as 



it is clear that we may write 



Y,PcQpTT{l(^A'i)f/ >l-2e, 

Cl3 



for positive numbers {ec/j} chosen to satisfy 



CP 



By the gentle measurement lemma. 



and thus, by the concavity of the square root function. 



E, 



■C/3 



Ai a 1 ® V Ai - e 



CP 

< 4Vi. 



/Af)ef''(l®A/A?)-e' 



(CP 



Along with (|25|1 and monotonicity with respect to Tr^^, this estimate allows us to express 

Ec/3|T~r|i < Ec/3|r-r'|i 

= Ec/3 



(27) 



2'" 



E, 



XI |(l ® v/A^) (l ® ^A^) 
.ct3 I i IX) V Ai) Ci f 1 ® - 



2"'' 



(l (g, VaT) (l ® v/aT) - a + Ec/3 E Tr(l (g> ArnKi 



m=2 



(28) 



provided that e < ^. Since the the entanglement fidelity is linear in I?(T), which is itself linear in 
T, we can also use the special triangle inequality to write 

FmM^cpV) = -F(|$),E^I?(EcT)) 

> F{\<S?),E0V{Ecr)) - |E^I?(EcT')-E^I?(EcT)|^. 
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Using our earlier observation from (|26|l and the definition of a (2"'^, n, e) entanglement transmission 
code, we can bound the first term as 

F(|$),I?(EcT')) = ^^(|$),I?oArf"of($)) 
> 1 - £• 

An estimate on the second term is obtained via 

|E^P(EcT)-E^P(EcT')li < E,, |P(Ec T) - I?(Ec T')li 

< E,3|EcT-EcT'|i 

< Ec/3|T-T'|, 

< 5V^, 

where first three lines are by convexity, monotonicity. and convexity once again of the trace norm. 
The last inequality follows from (|28|) . Putting these together gives 

Ec0F(|$),P(T)) > l-e-5V^ 

> 1 - 6V^. (29) 

At last, observe that the final decoded state fl (which still depends on both sources of randomness 
C and f3) is equal to 

implicitly defining the desired decoding instrument T> : C" — > BM. The expectation of (PJ can 
now be bounded as 

2"'' 



Ec0 2--^J2PsH^) = ^cpPl'il) 



m— 1 



= F{\im,Ecpn) 

> l-\Tv^gEcpT-\l){l\\^-3{l-Fm,V{T))) 

> l-2\/2^-18Ve 

> 1-21V^. 

The third line above is by Lemma [3 The first estimate in the fourth line follows from H25|) . while 
the second estimate is by (|29|l , together with Hll|) . We may now conclude that there are particular 
values of the randomness indices P and C such that the same bound is satisfied for a deterministic 
code. We have thus proven that {{\(f>m)}me2"^ , S ,T>) comprises a (2"*", 2"^, ti, 21v^) entanglement 
transmission code. This concludes the coding theorem. 



Proof of Theorem |2] (converse) Suppose that {R, S) is an achievable qq rate pair for entan- 
glement generation. By definition, this means that there must exist a sequence of (2"^, 2""^, n, e„) 
entanglement generation codes with e„ 0. Fixing a blocklcngth ?i, let |Ti)'^'* , |T2)^'^ and 
T) : C" AB comprise the corresponding encodings and decodings. Define 

^ABC" ^ (^AB ^ AA®»)(Ti (8 T2) 

to be the result of sending the respective A'"" and B'" parts of |Ti) and IT2) through the channel 
A/"**". Further defining 
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as the corresponding state after decoding, the entanglement fidelity of the code is given by 

Fab =F(|$i)®|$2),f^) (30) 

where |4>i)^^ and |<i>2)^^ are the maximally entangled target states. The sum rate can be bounded 
as 

h{AB)C^h > Ic{AB)AB)n 

> Ic{AB)AB)g.,^^,-2H{en)-8n{R + S)^, 

> n{R + S)-ne'^. 

The first step is by the data processing inequality. The second step uses Lemma Inland along 
with monotonicity applied to (|30|l . The last step has defined — ^ ~ + S),Je^ and holds 
because the binary entropy H{-) is upper bounded by 1. We can bound Alice's rate R by writing 

> h{A)AB)n 

> Ia{A)A)n 

> Ic{A)A)^, - 2iJ(e„) - 8nRy^ 

> nR ~ ne'„. 

The first three steps above are by data processing jJU]. The remaining steps hold for the same 
reasons as in the previous chain of inequalities. Similarly, Bob's rate also must satisfy 

nS <IciB)AC'')^+ne'„. 

Since e„ implies 0, this means that for every 6 > 0, any achievable qq rate pair (i?, S) 

must satisfy 

{R-5,S-S)e -Q^^^Af^") c Q{Af). 
n 

Since Q(A/') is closed by definition, this completes the proof. 



Remark Strictly speaking, the pair of nonnegative rates (i?, S) needs to be contained in some 
pentagon whose corner points (i/c(A)C'')<,, )AC'=)^) and (i/c(A)BC'=)<,, i/c(B)C'=^)„) 

are located in the upper right quadrant of R^, where a'^^'~^^ is some state of the form ©. For 
large enough n, the induced states u; in the above proof fulfill this role. To see this, note that 
an artifact of the steps which upper bound Alice's rate R is that ^Ic{A)BC^) > R — en and 
/c(A)C") > R — €„■ Since e„ 0, the right sides are eventually positive whenever R > 0. The 
similar steps which bound Bob's rate complete the argument. 

Proof of Theorem 121 (achievability) Fix bipartite pure states and |^'2)^ ^ which 

give rise to the state 

and define pi = Tr^^*!, = Tr^ ^2- Letting e, (5 > be arbitrary, we will show that there 
exists a {2^^', 2"'^, n, e) qq entanglement transmission code where 

R = Ic{A" )C)^ - 5 and S = Ic{B" )A"C)^ - 5 

provided that R,S>Q. Note that the rates in Theorem [21 will be implied by taking the channel 
to be A/''^'^, with uj^^^ defined similarly. 
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Let us begin by choosing an isometric extension Ujs/ : A' B' CE of M . Define the ideal 
channel Mi : A' ^ C which would effectively be seen by Alice were Bob's average code density 
operator exactly equal to " as 

TVi : T 7V(t (g) p2). 

We now use Uj\/ to define a particular isometric extension Uj\/^ : A' — > CE' of Mi, where E' = B"E, 
as 

Observe that Bob's fake input B" is treated as part of the environment of Alice's ideal induced 
channel. We then further define the channel M2 '■ B' A"G by 

M2 - T l-^7V(*l ®t). 

In contrast to the interpretation of Mi , this may be viewed as the channel which would be seen 
by Bob if Alice were to input the A' part of the purification |^'2)"^ of to her input of the 
channel and then send the A" system to Charlie via a noiseless quantum channel. As in the proof 
of Theorem^ Charlie will first decode Alice's information, after which he will attempt to simulate 
the channel M2, allowing a higher transmission rate for Bob than if Alice's information was treated 
as noise. Since quantum information cannot be copied, showing that this is indeed possible will 
require different techniques than were utilized in the previous coding theorem. Although ensembles 
of random codes will be used in this proof, we introduce the technique of coherent coding, in which 
we pretend that the common randomness is purified. The main advantage of this approach will be 
that working with states in the enlarged Hilbert space allows monotonicity to be easily exploited 
in order to provide the estimates we require. Additionally, before we derandomize at the end of 
the proof, it will ultimately be only Bob who is using a random code. Alice will be able to use 
any deterministic code from her random ensemble, as Charlie will implement a decoding procedure 
which produces a global state which is close to that which would have been created had Alice 
coded with the coherent randomness. To show this, we will first analyze the state which would 
result if both senders used their full ensembles of codes. Then we show that if Alice uses any code 
from her ensemble, Charlie can create the proper global state himself, allowing him to effectively 
simulate M2 and ultimately decode both states at the desired rates. 

By Proposition ^ for large enough n, there exists a (2"^, n, e) random entanglement trans- 
mission code {pi,£l,'D\) for the channel Mi, where R = Ic{pi,Mi) — 5 = Ic{A" )C) — 5. There 
similarly exists a (2"'^,ri, e) random entanglement transmission code (gm,^™!^™) fo^' ■^2, with 
S ~ Ic{p2,M2) — 5 = Ic{B" )A"C) — S. Proposition n] further guarantees that these codes can be 
chosen so that their respective average code density operators 

e m 

satisfy 

Ift-Pf"|i<e (31) 

and also that we may choose isometric extensions U^_^ : C" — > AF implementing the !){ from 
Alice's random code which satisfy 

F(\^i)^^\X)^^'\U^^oU^-o£i\'S>i)^^)>l^e (32) 

for every random code index i and the same fixed state |A)^^ . 

Let the code common randomness between Alice and Charlie be held between the systems La 
and Lc, represented by the state 

£ 
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defining a similar state ^^^^'^^^ for the Bob-Charlie common randomness. For convenience, let us 
further pretend that 71 is part of a pure state 



1 

e 

Similarly, let 72 by purified by |r2)^^^^^^^^'^ . Let us define the controlled encoding isometrics 
£1 : LaA LaA"" and £2 : MbB MbB"' as 

£i=J2 ® and £2 = Y1 ® ^2- 

I m 

The states which would arise if Alice and Bob each encoded coherently are 

IT2)™'" = £'2|r2)|<i>2) -^V9^|m)*^®£:2™|<i>2). 

Note that we have abbreviated L = LeLaLq and M = MeMbMc- As each \Ti) is a purification 
of Qi, together with ^M\ . Uhlmann's theorem tells us that there exist unitaries Vi : LA — > A"" and 
V2 : MB B"" such that 

F(K;|T,;),|*,)®") > 1-e. (33) 

Further define a corresponding controlled isometric decoder Ut>i ■ LcC" LcAF for Alice's code 
as 

I 

Let us now imagine that each of Alice and Bob encodes using the coherent common randomness, 
resulting in a global pure state Z^^"|Ti)|T2) on LAMBC'E". If Charlie then applies the fuU 
controlled decoder from Alice's code, the resulting global pure state would be 

^Q^LAAMBFE- ^^^^ o |Ti) IT2) . 

For each i, let us define an isometry : i?'" AAFE^ as 

^Ui^oU^"o£(i'i>^(d-) 
which we use to define the pure states 

These definitions allow us to express 

£ 

Further writing \\')FMBE'- ^ F2~^l^>^^ ""^"> tl^c following bound applies 

^^(i<f>i)-^^iAr*''''^M^^)) = i^(i<i>i)iAr™",o'iT2)) 

= i^(|$i)|A)^^""^",y2oO^|T2 

> 1 -2y'l -F(|$i)|A)-PS""£;"^c'£|^2)«?«) 

-2v/l-F(y2|T2>,|«'2>«") 

> l-2^l-F{\^,)\X)F^"\Ui^oU^-o£f\<i>,))-2V-e 

> 1 - 4V^. 
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Above, the first inequality is by the triangle inequality and monotonicity with respect to O^, while 
for the second inequality, we have just rewritten the first term and used H33|l for the second. 
The last bound is from H32(l . Observe that we are still free to specify the global phases of the 
outputs of the so that the above bound further imphes (6'^||$i)|A') > (1 — 4y^)^/^ for each £. 
Consequently, 



i^(|e),|ri)|$i)|A')) = J2V¥m^m'){0i\\<i>i)\x') 



> 1 - Ayfe. 

Essentially, the subsystems L, AA and MBFE" of |0) are mutually decoupled. 

As mentioned earlier, it will be sufficient for Alice to use any deterministic code from the 
random ensemble to encode. Without loss of generality, we assume that Alice chooses to use the 
first code (£ = 1) in her ensemble. Bob, on the other hand, will need to use randomness to ensure 
that Alice's effective channel is close to a product channel. The state on AMBC^ which results 
from these encodings is A/''*"(£i ($i) (8) T2). 

We will now describe a procedure by which Charlie first decodes Alice's information, then 
produces a state which is close to Q, making it look like Alice had in fact utilized the coherent 
coding procedure. This will allow Charlie to apply local unitaries to effectively simulate the channel 
Af2 for which Bob's random code was designed, enabling him to decode Bob's information as well. 
These steps will constitute Charlie's decoding T) : McC" MqAB, which depends on the Bob- 
Charlie common randomness. The existence of a deterministic decoder will then be inferred. 

Charlie first applies the isometric decoder U^_^, placing all systems into the state \9i). He 

then removes his local system A (it is important that he keep A in a safe place, as it represents 
the decoder output for Alice's quantum information) and replaces it with the corresponding parts 
of the locally prepared pure state Charlie also locally prepares the state iFi)^. The 

resulting state 



A°A° 



Tr 



AA ' 



satisfies 



F(e',e) > i-\Tr.,e, 



> 



> 



1 - 9ei/4 




3(i-F(|r)|$i),TrMi3Fij" e)) 



(34) 



whenever e < 12^'*. The first line combines Lemma 2 and the triangle inequality. The first two 
estimates in the second line are from applying and monotonicity with respect to Tr^^ and 
^LAA previous two estimates. The last estimate in that line is from monotonicity with 

respect to the map Ti' mbfe^ applied to the previous estimate. Next, Charlie will apply Vi to 
0' ^ in order to simulate the channel A/'2. To sec that this will work, define M : LAAFE" yl""C" 
as = Tte" Vi o and observe that by monotonicity with respect to ^®"( • (g) T2) and 

satisfy 



the states on A/BA""C" 

F{M{e),Aff-{T,)) 



> 
> 



F(yi|Ti),|vi/i)««) 

1 -e. 



T2),AA«"«"(gT2)) 



^This operation only acts on Charlie's local systems, i.e. Vi oUj,^ : LA°A°F A""-C^ 
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We may now use the triangle inequality and monotonicity with respect to M to combine our last 
two estimates, yielding 



i^(M(e'),AAf"(T2)) > 1-2^1-F {M{e'),M{e)) (A^(e), AAf "(T2)) 

> 1 - 2 V9ei/4 _ 2Ve 

> 1 - 7e^/^ (35) 

whenever e < 2^*/'^. We have thus far shown that Charlie's decoding procedure succeeds in 
simulating the channel A/"^", while simultaneously recovering Alice's quantum information. Charlie 
now uses the controlled decoder I?2 : McA""C" McB defined as 

m 

to decode Bob's quantum information. This entire procedure has defined our decoder V : McC"^ 
McAB which gives rise to a global state il,^^^^ representing the final output state of the protocol, 
averaged over Bob's common randomness. This state satisfies 

F(|$i),Tr^5r!) > F{e,e') 

> l-9ei/4, 

because of monotonicity with respect to Tt:lmbfe^ applied to the bound l|34(l . By using the 
triangle inequality, the fact (|22|l is satisfied for each m, and monotonicity of the estimate H35|) with 
respect to Tti\j , the global state can further be seen to obey 



F(|$2>,Tr^lf^) = F{\^2),T^rMV2oM{Q')) 



> l-2Jl-F(|$2),TrMl?2oAA2«"(T2))) 



2 Jl - F{TvM V2 o M^''{T2), Ttm V2 oM{e')) 



> l-2V^-2V7ei/8 

as long as e < 2~^^/^. Along with a final application of Lemma 2 combines the above two 

bounds to give 



Fi\^,)\<i>2),n) > l-|$i-Tr^gf7|^-3 l-F(|$2),Tr^^r!) 



> 1 - 2V9ei/4 - 21ei/i6 

> 1 - 22ei/i^ 

provided that e < 6~^^. Since this estimate represents an average over Bob's common randomness, 
there must exist a particular value m* of the common randomness so that the corresponding 
deterministic code is at least as good as the random one, thus concluding the coding theorem. 



5 Strong subspace transmission and scenario equivalences 
5.1 Scenario III - Strong subspace transmission 

The criteria of scenarios I and II, both in the cq and qq cases, arc directly analogous to the 
requirement in classical information theory that the average probability of error, averaged over all 



22 



codewords, be small. Here, we introduce a situation analogous to the stronger classical condition 
that the maximal probability of error be small, or that the probability of error for each pair of 
codewords be small. There are examples of classical multiple access channels for which, when each 
encoder is a deterministic function from the set of the messages to the set of input symbols, the 
maximal error capacity region is strictly smaller than the average error region |27) . However, it is 
known that if stochastic encoders arc allowed (see Problem 3.2.4 in (2H|)j the maximal and average 
error capacity regions are equal. 

It is well-known that randomization is not necessary for such an equivalence to hold for single- 
user channels, as Markov's inequality implies that a fraction of the codewords with the worst 
probability of error can be purged, while incurring a negligible loss of rate. The obstacle to 
utilizing such an approach for classical multiple access channels, and hence for quantum ones as 
well, is that there is no guarantee that a large enough subset of bad pairs of codewords decomposes 
as the product of subsets of each sender's codewords. 

As mentioned earlier in Section|21 a particularly attractive feature of the following two scenarios 
is their composability; when combined with other protocols satisfying analogous criteria, the joint 
protocol will satisfy similar properties. 

Ill - classical-quantum scenario Strong subspace transmission can be considered a more 
ambitious version of entanglement transmission, whereby rather than requiring Bob to transmit 
half of a maximally entangled state |(f>)-^^, it is instead required that he faithfully transmit the 
B part, presented to him, of any bipartite pure state j^f)^^, where \R\ can be any finite number. 
The reader should note that this constitutes a generalization of the usual subspace transmission 
P^ . as whenever |vl/)^^ = \i(;)^\(p)^ , this amounts to requiring that |(^) be transmitted faithfully. 
We further demand that the maximal error probability for the classical messages be small. 

As with entanglement transmission, Alice will send classical information at rate r by preparing 
one of 2"'' pure states {|0m)"* }me2"'-- As previously discussed, our more restrictive information 
transmission constraints can only be met by allowing Alice to employ a stochastic encoding. We 
assume that Alice begins by generating some randomness, modeled by the random variable X. To 
send message M — to, she prepares the state |/(to)) = \4>f(m)), where /(to) = fx(jn) is a random 
encoding function, depending on the £andomness in X. 

Bob will apply an encoding £ : B i?'", and Charlie will employ a decoding instrument 
"D: C" MB. These will be constructed by adding an additional layer of processing on top 
of the entanglement transmission codes which were proved to exist in the previous section. The 
success probability for the protocol, conditioned on to being sent and \'^)^^ being presented, can 
be expressed as 

pi"(TO,i*)) = f(i/(to))^|vi/)«s,i?oaa«"(0^;;)®£(m/«^))). 

We will say that (/, X, {\4>m)}me2"'- , £, T^) is a (2"'', 2"'^, n, e) cq strong subspace transmission code 
for the channel M if, for every to e 2"'' and every 

Ejf P™(to, I*)) > (36) 

The rate pair (r, S) is an achievable cq rate pair for strong subspace transmission if there is a 
sequence of (2"'", 2"'^, n, e„) cq random strong subspace transmission codes with e„ 0, and the 
capacity region CQiii{N) is closure of the collection of all such achievable rates. 

Ill - quantum-quantum scenario This scenario is the obvious combination of the relevant 
concepts from the previous scenario and the qq entanglement transmission scenario. Alice and 
Bob are respectively presented with the A and B parts of some pure bipartite states and 
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1^2)^^- As before, we place no restriction on \Q\ and other than that they are finite. They 
employ their respective encodings 8i and 82, while Charlie decodes with T). As in the above cq 
case, the structure of these maps will be more complicated than in the previous two scenarios. 
(fi,f2,2-') is then a (2"^, 2"'^, n, e) qq strong subspace transmission code if 

F (1*1)^^1^2)'^^, 2? oAA«"o(fi®£:2)(*?^®*^^)) > 1-e, (37) 

for every pair of pure bipartite states l^i)'^'^ and 1^2)''*^ • Achievable rates and the capacity region 
Qui (A/") are defined as in the cq case. 



5.2 Entanglement transmission implies entanglement generation 

Proof that CQu C CQi: Suppose there exists a (2"'', 2"^, rt, e) cq entanglement transmission 
code, consisting of classical message states {l^m)"^ "}m&2'^^, a quantum encoding map £ : B ^ B, 
and a decoding instrument T>: C" MB. Write any pure state decomposition of the encoded 

i 

Then, the success condition Q for a cq entanglement transmission code can be rewritten as 

2'"' 

1-e < 2-"'-^P,"(m) (38) 

m— 1 

= 2-"'- ^^(l*)''^,!?™ ° AA«5"(0r ® iT^P^y^)) (39) 

m— 1 i 

= J2p, (2-"'" E ^ (|1>)''^,2?m ° AA«"(0r ® T,))) (40) 



7n—l 



2--^Pi(m,|T,)) , (41) 



so that there is a particular value z* of i for which 



2-"^5^Pi(m,|T,.)) >l-e. 



m— 1 

"inr nnS 



Hence, ({|0m)}me2"'-, ITi-),!?) comprises an (2"*", 2""^ , n, e) cq entanglement generation code. 



Proof that Qu C Qi : Suppose there exists a (2"^, 2"'^, n, e) entanglement transmission code 
{£i,£2,'D) which transmits the maximally entangled states |$i),|$2)- As in the cq case, the 
encoded states can be decomposed as 

(l^®fl)($l) =;^p,Ti,; 

i 

and 

(l^(^£2)($2) =^g,T2,. 

j 
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The reliability condition ^ can then be rewritten as 

^p,<7,F(|$i)|$2),I?®AA®"(Ti,®T2j)) > 

ij 

which implies the existence of a particular pair of values of such that 

F(|$i)|$2),I?(8)AA«"(Ti,. ®T2,.)) > 
Hence, (|Tii.), \T2j*),'D) comprises a {2"^,2'^^ ,n,e) qq entanglement generation code. 

■ 

5.3 Entanglement transmission implies strong subspace transmission 

Proof that CQu C CQui- Suppose there exists a (2"'', 2"'^, n, entanglement transmission 
codes with classical message states {|(/)m)^ }me2"'-, quantum encoding E.B^B, and decoding 
instrument X*: C" MB with trace-reducing components {Vm ■ C" B}. 

Wc will initially prove the equivalence by constructing a code which requires two independent 
sources of shared common randomness X and Y. X is assumed to be available to Alice and to 
Charlie, while Y is available to Bob and to Charlie. Then, wc will argue that it is possible to 
eliminate the dependence on the shared randomness, by using the channel to send a neglibly small 
"random seed" , which can be recycled to construct a code which asymptotically achieves the same 
performance as the randomized one. 

Wc begin by demonstrating how shared common randomness between Alice and Charlie allows 
Alice to send any message with low probability of error. Setting fi = 2"'', let the random variable 
X be uniformly distributed on the set {1,...,^}. To send message M ~ m, Alice computes 
m' ^ m + X modulo fi. She then prepares the state |0,„') for transmission through the channel. 
Bob encodes the B part of |$)-^^ with £, and each sends appropriately through the channel. Charlie 
decodes as usual with the instrument 13. Denoting the classical output as M' , his declaration of 
Alice's message is then M — M' — X modulo /i. Defining the trace-reducing maps Mm : B ^ B 

by 

and the trace-reducing average map as 

1 ^ 

X: r ^ - VX,„(t), 
we can rewrite the success criterion ^ for entanglement transmission as 

which, together with implies that for the identity map id : B ^ B, 

\iM-id)m,<e. (42) 

The above randomization of the classical part of the protocol can be mathematically expressed 
by replacing the Mm with Mm+x- As tracing over the common randomness X is equivalent to 
computing the expectation with respect to X, we see that Kx Mm+x = M, or rather 

Ex Fm,Mrn+xm ^ Fm,Mm. 

It is thus clear that the maximal error criterion for the randomized protocol is equal to the average 
criterion for the original one. 
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We continue by randomizing the quantum part of the classicaUy randomized protocoL Setting 
d = 2"^ = \B\, let {Uy}y^d^ be the collection of Weyl unitaries, or generalized Pauli operators, on 
the (i-dimensional input space. Observe that for any p, acting with a uniformly random choice of 
Weyl unitary has a completely randomizing effect, in the sense that 

1 

Let the random variable Y be uniformly distributed on {1, . . . , d^}. It will be convenient to define 
the common randomness state 



d2 

where the system Yb is in the possession of Bob, while Yc is possessed by Charlie. Define now the 
controlled unitaries Ub ■ YbB YbB and Uc ■ YqB YqB by 



UB = Y.\y){y\''^ ®Uy 



and 

d" 

Uc^Y.\y)^y\'" ®^y'■ 
Suppose Bob is given the B part of an arbitrary pure state |^)^^, where \R\ < oo, and Alice sends 
the classical message M = m. For encoding. Bob will apply £oUb to the combined system T (g) 
Charlie decodes with Uq o 2?- If were equal to the perfect quantum channel id: _B — > S, this 
procedure would result in the state 



^Ei2/)(2/r"®i2/)(2/r"®*- 



Note that the common randomness is still available for reuse. Abbreviating = I?/) (2/1^^ ® 

\y){y\^'', and \^y)^^ = (1^ ® Uy)\^), we write 

a^^^ = Ub{T®-^) (43) 

d- 

= ^El2^)(yl''®*^- (44) 
y=i 

Observe that a is an extension of the maximally mixed state tt^, and can be seen to arise by 
storing in Y the result of a von Neumann measurement along the basis {\y)^ lysd^ on the R' part 
of the pure state 



Since TtrirT = Txyrct = tt^, |r) is maximally entangled between R' R and B. So, there exists 
an isometry V : B ^ R! R such that {V ® 1-^)|$)^-^ = |r). This imphes that there is a quantum 
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operation O: B ^ YR such that [O (S) 1^)(<I>) = a. Define the trace-reducmg map T: B ^ ]3, 
which represents the coded channel with common randomness accounted for, by 

T: T ^ TiyUc o M oUb{T ® t). 

Recalhng our denotation of the noiseless quantum channel id: B ^ B, as well as our convention 
that id acts as the identity on any system which is not B, we now bound 

i-F{\^),T{^)) < |(r-id)(*)|^ 

< \{UcoMoUb -id)(T(E)^)\^ 
= \{M-id)oUB{T(g>'^)\^ 

= \{M-id)ia)l 

< |(X-id)($)|^ 

< e, 

where the first line is by © and the second by monotonicity with respect to Try. The third 
follows from unitary invariance of the trace. The second to last inequality is a consequence of 
monotonicity with respect to O, while the last is by (|42|l . Note that by monotonicity, this implies 
that any density matrix ^l^^ satisfies 

\T{n)-n\i<e. (45) 

We have thus shown that if Alice and Charlie have access to a common randomness source of rate 
r, while Bob and Charlie can access one of rate 25", the conditions for strong subspace transmission 
can be satisfied. Next, we will illustrate that, by modifying our protocol, it is possible to reduce 
the amount of shared randomness required. Using the previous blocklength-n construction, we will 
concatenate N such codes, where each utilizes the same shared randomness, to construct a new 
code with blocklength nN. For an arbitrary , further define the commuting operations 

{Tijigjv, where %: Bi ^ Bi is T acting on the I'th tensor factor of Setting = we 

then recursively define the density operators S,i = 7i(^i_i), noting that ^at = T/v o ■ ■ ■ o 7l(^o) = 
r«'^(^'(^)). Because of (gS)), - U\i = - < e, and we can use the triangle 

inequality to estimate 

N 



< 



2 = 1 

< Ne. 

By choosing = it is clear that we have reduced Alice's and Bob's shared randomness rates 
respectively to y/er and 2^S, while the error on the A^-blocked protocol is now y/e. Next, we 
argue that by using two more blocks of length n, it is possible to simulate the shared randomness 
by having Alice send nr random bits X using the first block, while Bob locally prepares two copies 
of |(f>), written -82-82 ^ ^^j^^^ transmits the B1B2 parts over the channel using both blocks. 

Charlie decodes each block separately, obtaining a random variable X and the Bi and B2 parts of 
the post-decoded states Qf^^^ and $7^^^^. Bob and Charlie then measure their respective parts of 
r^i (g) hi^ some previously agreed upon orthogonal bases to obtain a simulation T of the perfect 
shared randomness state which, by monotonicity and telescoping, satisfies 

|T-f|i < \^ (g) ^ - Qi (g) n2\i 

< £2. 
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Further, the noisy shared randomness for the classical messages can be shown to satisfy 



|dist(X,X) - dist(X,X)|j 2Pt{X = X} 

< e\ 

By monotonicity of trace distance and the triangle inequality, using the noisy common randomness 
state T increases the estimate for each block by 2e^. For identical reasons, the same increase is 
incurred by using the noisy common randomness {X^X). Thus, accounting for both sources of 
noisy common randomness, the estimate ()45|l is changed to 2e, provided that e < i. The noisy 
common randomness thus increases the bound on the error of the A^-blockcd protocol to 2y^, while 
costing each of Alice and Bob a negligible rate overhead of in order to seed the protocol. 

The above protocol can be considered as defining an encoding map £' : Qi(N+2)n 
decoding instrument V : C(^+2)" B^M^. Thus, the protocol takes a (2"'', 2"'S', n, e„) cq en- 
tanglement transmission code and constructs a (2" ^ 2" ,n',e^j,) strong subspace transmission 

code with cq rate pair (r', S") = ^jq^T— , t^^t— ^ j where ?i' = ^2 + -^^^ n, and e^, = 2y/e^. Now, if 

the rates (r, S) are achievable cq rates for entanglement transmission, there must exist a sequence 
of (2"'', 2""^, n, 2e^) entanglement transmission codes with e„ — > 0. Since this means that ^^^^^ 
increases to unity, we have shown that for any 5 > 0, every rate pair {r — 6, S ^ S) is an achievable 
cq rate pair for strong subspace transmission. Since the capacity regions for each scenario are 
defined as the closure of the achievable rates, this completes the proof. 



Proof that Qu C Qm: We will employ similar techniques as were used in the previous proof 
to obtain this implication. Suppose there exists a (2"^, 2"^^, ti, ^e^) qq entanglement transmission 

code (Si, £2, V), with £1: A'", £2: B ^ B'", and P : C" ^ AB. Setting a = |I| = 2"-" and 
b = \B\ = 2"'^, define the common randomness states 



x=l 



and 



1 

x=l 



These states will be used as partial inputs to the controlled unitaries 

x = l 

Uc = Y.\''){xf- ®U-\ 

x=l 

}? 
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where, as before, we have utihzcd the Weyl imitaries {Ux}xea^ a-nd {V^y}ygb2, which respectively 
completely randomize any states on a-dimensional and 6-dimensional spaces. Suppose Alice and 
Bob are respectively presented with the A and B parts of the arbitrary pure states j^'i)'^"* and 
1^-2)-^^. Writing M=Vo TV®" o {£1 (g) £2), and defining the map T : AB AB hy 

T : T ^ {Uc ®Vc) o M o {Ua® Vb){t Ti ® T2), 
the overall joint state of the randomized protocol is given by T{'^i ® '^2)- Abbreviating 

\xy){xyf'' = \x){xf- ® \x){xf^ ® ® \y){y\''^ 

and defining l*^)^^ = (l'^ ® Ux)\^i), Y^y)'^^ = (1^ ® Vv)Y^2), we write 

^XYQRAB ^ -1^ ^ \^y){:cy\ ® vj/^ ® ^>y. 
xy 

By similar arguments as in the cq case, there exists a map O : AB XYQR so that 

Again, for the same reasons as in the cq case, we have 

|(r-id)(1'i®*2)|i < \{M-id)ia)\i 

< -id)($i ® $2)|i 

< e. 

The rest of the proof is nearly identical to that from the previous section, so we omit these details, 
so as not to have to repeat our previous arguments here. ■ 

5.4 Strong subspace transmission implies entanglement transmission 

Proof that CQm C CQu- Given a strong subspace transmission code, if Alice uses any 
deterministic value x for her locally generated randomness X, the average classical error will be 
equal to the expected maximal classical error of the randomized code. Since the ability to transmit 
any state includes the maximally entangled case, this completes the claim. 

Proof that Qm C Qu: This implication is immediate. As any states can be transmitted, this 
certainly includes the case of a pair of maximally entangled states. 

6 Discussion 

There have been a number of results analyzing multiterminal coding problems in quantum Shannon 
theory. For an i.i.d. classical-quantum source XB, Devetak and Winter [30] have proved a Slepian- 
Wolf-like coding theorem achieving the cq rate pair {H{X\B), H[B)) for classical data compression 
with quantum side information. Such codes extract classical side information from i?" to aid in 
compressing X". The extraction of side information is done in such a way as to cause a negligible 
disturbance to B^ . Our Theorem is somewhat of this flavor. There, the quantum state of C"" 
is measured to extract Alice's classical message which, in turn, is used as side information for 
decoding Bob's quantum information. Analogous results to ours were obtained by Winter in his 
analysis of a multiple access channel with classical inputs and a quantum output, whereby the 
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classical decoded message of one sender can be used as side information to increase the classical 
capacity of another sender. 

We further mention the obvious connection between our coding theorems and the subject of 
channel codes with side information available to the receiver. The more difficult problem of classical 
and quantum capacities when side information is available at the encoder is analyzed by Dcvctak 
and Yard in |40| , constituting quantum generalizations of results obtained by Gclfand and Pinsker 
|31j for classical channels with side information. 

In an earlier draft of this paper, we characterized Qi-hf) as the closure of a regularized union 
of rectangles 

< i? < tIc{A)C'') 
k 

< 5 < 

This solution had been conjectured on the basis of a duality between classical Slepian-Wolf dis- 
tributed source coding and classical multiple-access channels |2H1 112] , as well as on a purported 
no-go theorem for distributed data compression of so-called irreducible pure state ensembles that 
appeared in an early version of |32j . After the earlier preprint was made available, Andreas Winter 
announced \6'6\ recent progress with Jonathan Oppenhcim and Michal Horodecki |34| on the quan- 
tum Slepian-Wolf problem, offering a characterization identical in functional form to the classical 
one, while also supplying an interpretation of negative rates and apparently evading the no-go 
theorem. Motivated by the earlier mentioned duality, he informed us that the qq capacity region 
could also be characterized in direct analogy to the classical case. Subsequently, we found that we 
could modify our previous coding theorem to achieve the new region, provided that the rates are 
nonnegative. After those events unfolded, the authors of found an error in the proof of their 
no-go theorem, leading to a revised version consistent with the newer developments. Our earlier 
characterization of Q{M)^ while correct, is contained in the rate region of Theorem 2 for any finite 
fc, frequently strictly so. The newer theorem, therefore, gives a more accurate approximation to 
the rate region for finite k. In fact, for any state arising from the channel which docs not saturate 
the strong subadditivity inequality |35) , the corresponding pentagon and rectangle regions arc dis- 
tinct. Another beneficial feature of the new characterization is that it is possible to show that the 
maximum sum rate bound R + S < maxIc{AB )C) is additive, where the maximization is over all 
states of the form © , for any channel which is degradable in the sense of |37| . 

More recently, we discovered that the same technique used to prove the new characterization of 
Q(A/') implies a new cq coding theorem, and thus a new characterization of CQ{M). By techniques 
nearly identical to those employed in the coding theorem for Theorem 2, it is possible to achieve 
the cq rate pair 

{r,S) = {l{X-BC),h{B)C)) 

corresponding to Bob's quantum information being used as side information for decoding Alice's 
classical message. This is accomplished by having Charlie isometrically decode Bob's quantum 
information, then coherently decode to produce an effective channel Ni : A' EC so that Alice 
can transmit classically at a higher rate. The new characterization is then a regularized union of 
pentagons, consisting of pairs of nonnegative rates (r, S) satisfying 

r < I{X;BC) 
S < Ic{B)CX) 
r + S < I(X;C) + Ic{B)CX)^I{X;BC) + Ic{B)C). 

Surprisingly, it is thus possible to characterize each of CQ{Af) and Q{Af) in terms of pentagons, 
in analogy to the original classical result. This situation makes apparent the dangers of being 
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satisfied with regularized expressions for capacity regions. Without being able to prove single- 
letterization steps in the converses, it is hard to differentiate which characterization is the "right" 
one. While it is intuitively satisfying to see analogous formulae appear in both the classical and 
quantum theories, the regularized nature of the quantum results blurs the similarity. Indeed, the 
problems with single-letterization for single-user channels appear to be amplified when analyzing 
quantum networks (see e.g. Perhaps this indicates that the necessity of understanding the 

capacities of single-user channels at a level beyond regularized optimizations is even more pressing 
than previously thought. We should mention that for the erasure channel analyzed in Section mi 
the newer description of CQ{M) is not an issue, as the new corner point is contained in the old 
rectangle for any state arising from any number of parallel instances of the erasure channel. On the 
other hand, we demonstrate in [211 that for the collective phase flip channel, both characterizations 
single-letterize, yielding a classical-quantum region which is identical in form to that obtained for 
Q{M) in Section 17721 replacing the quantum rate with a classical one. 

Consider the full simultaneous classical-quantum region S{N) for two senders, where each sends 
classical and quantum information at the same time. A formal operational definition of S{M) is 
found in |38l I39| . This region can be characterized in a way that generalizes Theorems 1 and 2 
as the regularization of the region S^^\N)^ defined as the vectors of nonnegativc rates (r, s, i?, S) 
satisfying 



r 


< 


I{X;C\Y) 


s 


< 


I{Y;C\X) 


r + s 


< 


I{XY-C) 


R 


< 


Ic{A)BCXY) 


S 


< 


Ic{B)ACXY) 


R+S 


< 


Ic{AB)CXY) 



for some state of the form 

^XYABC ^ ^p(^)p(y)|^)(^|^ ^ \y)l^y\y ® M {i^^^' ® <t>^ ) , 

arising from the action of N on the A' and B' parts of some pure state ensembles {p{x)^ \i^x)^^ }, 
{p(y), }• Briefly, achievability of this region is obtained as follows. Using techniques intro- 

duced in 127], each sender "shapes" their quantum information into HSW codewords. Decoding is 
accomplished by first decoding all of the classical information, then using that information as side 
information for a quantum decoder. The main result of |37j . the regularized optimization of the 
cq result from over pairs of input ensembles, and our Theorems 1 and 2 follow as corollaries 
of the corresponding capacity theorem. Indeed, the six two-dimensional "shadows" of the above 
region, obtained by setting pairs of rates equal to zero, reproduce those aforementioned results. 
This characterization, however, only utilizes the rectangle description of CQ{M). It is indeed pos- 
sible to write a more accurate regularized description of S{M) which generalizes the pentagon 
characterizations of CQ{Af) and Q{Af), although we will not pursue that at this time. 
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7 Appendix 



7.1 Proof of additivity of CQ for quantum erasure multiple access chan- 
nel 

Due to the regularized form of our Theorems 1 and 2, the possibiUty of actuaUy computing the 
capacity regions seems generally out of reach. Here we give some examples of channels whose 
capacity region does in fact admit a single-letter characterization, in the sense that no regularization 
is necessary. 

Our first example is a multiple access erasure channel Af: A'B' C, where \A'\ — 2, \B'\ = d 
and |C| = d + 1. Alice will send classical information while Bob will send quantum. Fixing bases 
{|0)^', |d)^'},{|0)'^,. .. , the channel has d + 1 operation elements 

d 

i=i 

TV,; = \zf{l\^'{zf, i=l,...d. 

The action of the channel can be interpreted as follows. First, a projective measurement of Alice's 
input along {|0), |1)} is performed. If the result is 0, Charlie's output is prepared in a pure state 
|0). Otherwise, Bob's input is transferred perfectly to the remaining degrees of freedom in Charlie's 
output. Bob's input is "erased", or otherwise ejected into the environment, whenever Alice sends 
|0), and is perfectly preserved when she sends Indeed, the action of A/" on (g) is given by 

AA(r®p) = Too|0)(0| +Tiip. 

In the sense of (|7)l, any state which arises from A/'^'^ can be specified by fixing some 

pure state ensemble {p{x), \(t)x)^ } and a pure bipartite state j'l')^^ . We thus write 

n = ^p(a;)|a;)(x|^ (1^ AA®'=)((^, ® 

X 

For a binary string y'^, let ly'^)^ — • • • \yk)^ be the associated computational basis state. 

Writing p{y''\x) = \{y''\(f'x)\'^ defines the random variable Y'', which is correlated with X, and can 
be interpreted as the erasure pattern associated with the state ^. We next define another state of 
the form 0, 

for 

jk 

where the summation is over d-ary strings of length k, — [ji, . . . ,jk)- Finally, for 

q, = Pr{l-=0}, 
1 

2 — 1 
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define a third state 



o;^^^ = q\0){0f ® ® |0)(0r + (1 - 9)|l>(ir ® 
The above states can easily be seen to satisfy the following chain of inequalities 

IiX;C% = IiX;C%, 

< H{Y%r 

k 

i=l 
k 

i=l 

< kH{q) 
= kH{U)^ 

= kI{U;C)^. 

The only nontrivial step above is that we have used the concavity of the binary entropy function 
in the last inequality. Furthermore, it is not hard to see that 



h{B)C''X)n < Ic{B)C''XY'')n' 



k\ 

Thus, we have shown that for any state fl^^'^'^ arising from J\f'^'^ in the sense of (Q, there is a 
state u)^^'-' arising from M in the same sense, allowing the multi-letter information quantities to 
be bounded by single-letter information quantities; i.e. CQ{N) — CQ^^\N). ■ 
As it is clear that /([/; C)^ = H{q), we focus on calculating 



BC 



h{B )CU)^ = q (i/(|0) (0|^) - HinU ® |0) (0|^)) + (1 - g) [H{n^) - 

^ <z(0-logd) + (l-q)(logd-0) 
= (l-2g)logd. 

Note that the above quantity is a weighted average of a positive and a negative coherent infor- 
mation. It is perhaps tempting to interpret these terms as follows. The positive term can be 
considered as resulting from a preservation of quantum information, while the negative term can 
be seen as signifying a complete loss of quantum information to the environment. The overall 
coherent information is positive only when g < i , a result which is in agreement with the result of 
Bennett et al. on the quantum capacity of a binary erasure channel. Varying < g < i, the 
rate pairs 

ir,S) = {mC),I,iB)CU))^ 
= {Hiq), il-2q) log d) 

can be seen to parameterize the outer boundary of CQ{M), as is pictured in figure ^ for the case 
d = 2. 

As an aside, we remark that this calculation, together with the quantum channel capacity 
theorem from |19| . gives a direct derivation of the quantum capacity of a quantum erasure channel, 
without relying on the no-cloning and hashing arguments used in j41| . 
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7.2 Proof of additivity of Q for collective phase flip channel 

While the description of the capacity region Q in Theorem 2 generally requires taking a many-letter 
limit, we give here an example of a quantum multiple access channel Afp : A'B' C for which that 
description can be single-letterized. The channel Afp takes as input two qubits, one from Alice and 
the other from Bob. With probability p, the channel causes each qubit to undergo a phase flip, by 
rotating the state of each by 180° about its z-axis on the Bloch sphere, before it is received by the 
receiver Charlie. The action of Afp on an input density operator ^ is described in terms of the 
operator sum representation as 

Afpip) = (1 - p)p + p{(7z ® crz)p{crz ® (7z), 

where az is the Pauli phase flip matrix. We will demonstrate that Q{Afp) is equal to the collection 
of all pairs of nonnegative rates (i?, S) which satisfy 

R < 1 
S < 1 
R + S < 2-H{p). 

Proof. We first observe that for any input state of the form ^Ic{AB)C'') is upper bounded 
by the quantum capacity of Afp when both senders may act together. Since Afp is a generalized 
dephasing channel |37| . its quantum capacity can be calculated by a single- letter optimization of 
the coherent information over input density operators which are diagonal in the dephasing basis 
which, incidentally, is just the computational basis. A short calculation reveals that it suffices 
to check inputs of the form p„ = i(a(|00)(00| + |11)(11|) + (1 - a)(|10)(10| + |01)(01|)) when 
computing the quantum capacity of Afp. Furthermore, since Afp is also degradable |37) . Ic{p,Afp) 
is concave as a function of p (see Section 17.31 for a simple proof). Since IdPonAfp) = Ic{pi-onAfp), 
one then concludes by symmetry that the maximum is achieved for a ^ i, in which case pj. is 

maximally mixed, yielding maxp Ic{p,Afp) = Ic{t^^'^' Mp) = 2 - H{p). Note that this maximizing 
input is a product state, since tt^ ^ = tt^ ® ir^ . Define the Bell states 

IV'±) = ^(|oo)±|ii; 

As purifies the maximally mixed state 7r2, let us write the global state 

Identifying C = AB in the obvious way, we reexpress 

L.^^^^ = (1 - p)i>i^ ® ^f" + ^ ® 
It is now a simple task to calculate 

H{ABC) = H{uo) = H{p) 
H{C) = H{tt^)^2 
H{AC) = H{AA) + H{B) = H{p) + 1 = H{BC). 
Combining these gives the relevant coherent informations 

Ic{AB)C) = H{C) - H{ABC) = 2- H{p) 

Ic{A)BC) = H{BC) - H{ABC) = 1 + H{p) - H{p) = 1 

Ic{B)AC) = H{AC) - H{ABC) = 1. 

Clearly Ic{A)BC) < log \A'\ = 1 and Ic{B )AC) < log \B'\ = 1 for any state arising from Afp. The 
individual rate bounds are thus saturated and the claim follows. ■ 
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7.3 Concavity of coherent information for degradable channels 

Recall that a given a channel M : A' B, one may define a complementary channel Afc : A' ^ E 
as TtbU, where U: A' ^ BE is any isometric extension of Af. The channel Af is then said to be 
degradable if there exists a channel D: B ^ E which degrades M to Afc, in the sense that 
Mc = I? o J\f. A single- letter description of the quantum capacity of any such channel was given in 
P7] as maxp Ic{p,M). We will show that if JV is degradable, then Ic{p,M) is a concave function of 
P- 

Proof. Fixing density matrices p^' and p^', we write p^^' = Ao|0)(0|^ po + Ai|l)(l|^ (X) pi, 
where < An, Ai satisfy Aq + Ai = 1. Setting cr'^^^ = U{p^^') and a"^^'^ = V(Tr£;CT), where 
V: B ^ E'F isometrically extends the associated degrading map V, we write 

Ao/c(po,AA)+Ai/e(pi,AA) = H{B\U)^-H{E\U)^ 

= H{FE'\U)a' - H{E'\U),^ 

= H{F\E'U),, 

< H{F\E\, 

= H{B)„-H{E), 

= /c(Ao/5o + Aipi,7V). 

Here, we used the fact that V preserves entropies and the form of strong subadditivity which states 
that conditioning cannot increase entropy. This proves the claim. ■ 

7.4 Proof of convexity of CQ and Q 

Let M : A'B' ^ C be a quantum multiple access channel. We will prove that Q{Af) is convex, as 
the proof for CQ is identical. 

Proof. Let fco and fci be positive integers, and fix any two states of the form o'o" " and 
^MB^c"^ rj.^^^ ^^^^ g^-^^ S'l) G Q{N), where for i e {0, 1}, 

R = ^Ic{A,)C''^),, 
ki 

We will now show that for any rational < A < 1, A(i?o, ^o) + (1 - X){Ri,Si) G Q(A/'). We first 
write A = ^, for integers satisfying /3 > 0, > a > 0. Setting po ~ aki, pi ~ {P — Q:)fco, and 
k — poko +piki, define the composite systems A = A^°A^^ and B = B^'^'B^^ , as well as the density 
matrix o"^^'-^ — a®^° ® erf , which is also of the form ^ . Additivity of coherent information 
across product states and some simple algebra gives 

k k K 

_ pokpRp +pikiRi 

Paka +piki 

= Ai?o + (l-A)i?i. 

An identical calculation shows that ^Ic{B)C^)„ = XSq + (1 — A)iS'i. As Q{Af) was defined as the 
topological closure of rate pairs corresponding to states which appropriately arise from the channel, 
the result follows because the set of previously considered A's comprises a dense subset of the unit 
interval. ■ 
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7.5 Proof of cardinality bound on X. 

Begin by fixing a finite set X, a labelled collection of pure states }xex, and a pure bipartite 

state I^I')-^^ . For each x, these define the states a^'~"' = N{4)x ® and lo^ = Tr^ (Jx- Assume for 
now that > |C|. Define a mapping f : X ^ RI<^I'+\ via 

/: a-K^ = K,H(c^,),/,(i?)C),J, 

where we identify lOx with its |Cp — 1 dimensional parameterization. By linearity, this extends to 
a map from probability mass functions on X to mI*-^' where 

/: p{x) ^ ^p(x)/, = {u;p,H{C\X)p,h{B)CX)p), 

X 

Our use of the subscript p should be clear from the context. The use of Caratheodory's theorem 
for bounding the support sizes of auxilliary random variables in information theory (see |28| 'l is 
well-known. Perhaps less familiar is the observation |42l 1^ that a better bound can often be 
obtained by use of a related theorem by Fenchel and Eggleston which states that if S* C W 
is the union of at most n connected subsets, and if y is contained in the convex hull of 5*, then y 
is also contained in the convex hull of at most n points in S. As the map / is linear, it maps the 
simplex of distributions on X into a single connected subset of R'*-^' Thus, for any distribution 
p(x), there is another distribution p'{x) which puts positive probability on at most |Cp + 1 states, 
while satisfying f{p) ~ f{p')- If it is instead the case that \A'\ < \C\, this bound can be reduced 
to I A' p + 1 by replacing the first components of the map / with a parameterization of 0^ , as 
specification of a density matrix on A' is enough to completely describe the resulting state on C. 
It is therefore sufficient to consider \X\ < min{|A'|, |C|}^ + 1 in computing CQ'-^^Af). 



References 

[1] R. Ahlswede, "Multi-way communication channels," Second Intern. Sympos. on Inf. Theory, 
Thakadsor, 1971, Publ. House of the Hungarian Adad. of Sciences, 23-52, 1973. 

[2] Liao, "Multiple access channels", Ph.D. dissertation, Dept. of Electrical Engineering, Univer- 
sity of Hawaii, 1972. 

[3] T. Cover, A. El Gamal, M. Salehi, "Multiple-access channels with arbitrarily correlated 
sources," IEEE Trans. Info. Th. vol. 26, no. 6, November 1980. 

[4] P. W. Shor, J. A. Smolin, "Quantum error-correcting codes need not completely reveal the 
error syndrome," [quant-ph/9604006| 

[5] A. Winter, "The capacity of the quantum multiple access channel," IEEE Trans. Info. Th., 
vol. IT-47, pp. 3059 - 3065, November 2001. 

[6] G. Klimovitch, "On the classical capacity of a quantum multiple access channel," Proc. IEEE 
Intern. Sympos. Inform. Theory, Washington D.C. June 2001, pg. 278. 

[7] G. Klimovitch, A. Winter, "Classical capacity of quantum binary adder channels," 
[quant -ph/0502055 

[8] A. Uhlmann, "The 'transition probability' in the state space of a *-algebra," Rep. Math. Phys., 
9:273-279, 1976. 



36 



[9] I. Devetak, A. Winter, "Distilling common randomness from bipartite quantum states," 
|quaiit-ph/0304196{ 2003. 

[10] B. Schumacher, M. A. Nielsen, "Quantum data processing and error correction," Phys. Rev. 
A, 54(4)2629, 1996. 

[11] E. B. Davies, J. T. Lewis, "An operational approach to quantum probability," Comm. Math. 
Phys., vol. 17, pp. 239-260, 1970. 

[12] H. Barnum, C. M. Caves, C. A. Fuchs, R. Josza, B. Schumacher, "Noncommuting mixed 
states cannot be broadcast," Phys. Rev. Lett. , 76(15):2818-2821, 1996. 

[13] M. B. Ruskai, "Beyond strong subadditivity: improved bounds on the contraction of general- 
ized relative entropy," Rev. Math. Phys. , 6(5A) 1147-1 161 1994. 

[14] M. Nielsen, I. Chuang, "Quantum Information and Quantum Computation," Cambridge Uni- 
versity Press, 2001. 

[15] R. Alicki, M. Fannes, "Continuity of quantum conditional information," J. Phys. A, vol. 37, 
pp. L55-L57, January 2004. 

[16] A. Winter, "Coding theorem and strong converse for quantum channels," IEEE Trans. Info. 
Th., vol. 45, no. 7, Nov. 1999. 

[17] S. Lloyd, "Capacity of the noisy quantum channel," Phys. Rev. A, vol. 55, no. 3, pp. 1613- 
1622, 1997. 

[18] P. Shor, "The quantum channel capacity and coherent information," lecture notes, MSRI 
Workshop on Quantum Computation, 2002. 

Available at http : / / www . msri . org/publications/ln/msri/2002/ queLntumcrypto/ shor/ 1/| 

[19] L Devetak, "The private classical information capacity and quantum information capacity of 
a quantum channel," IEEE Trans. Info. Th. vol. IT-55, pp. 44-55, January 2005. 

[20] A. S. Holevo, "The capacity of the quantum channel with general input states," IEEE Trans. 
Info. Th. vol. 44: 1^0. 1, p. 269. 

[21] B. Schumacher, M. D. Westmoreland, "Sending classical information via noisy quantum chan- 
nels," Phys. Rev. A, vol. 56, no. 1, p. 131. 

[22] T. Cover, J. A. Thomas, Elements of information theory, John- Wiley & Sons, Inc., 1991. 

[23] A. S. Holevo, "Bounds for the quantity of information transmitted by a quantum channel," 
Prohl. Inf. Transm., vol. 9, pp. 177-183, 1973. 

[24] E. H. Lieb, M. B. Ruskai, "Proof of the strong subadditivity of quantum-mechanical entropy," 
J. Math. Phys. 14, pp. 1938-1941, Dec. 1973. 

[25] M. Nielsen, D. Petz, "A simple proof the strong subadditivity inequality," |quant-ph/0408130| 

[26] M. Koashi, A. Winter, "Monogamy of entanglement and other correlations," 
Phys. Rev. A 69(2):022309, 2004. 

[27] G. Dueck, "Maximal error regions are strictly smaller than average error regions for multi-user 
channels," Prohl. Contr. Inform. Theory, vol. 7, pp. 11-19, 1978. 

[28] I. Csiszar, J. Korner, Information Theory: Coding Theorems for Discrete Memoryless Systems, 
Akademiai Kiado, Budapest. 



37 



[29] H. Barnum, E. Knill, M. Nielsen, "On quantum fidelities and quantum capacities," IEEE 
Trans. Info. Th., vol. 46, no. 4, July 2000. 

[30] I. Devetak, A. Winter, "Classical data compression with quantum side information," Phys. 
Rev. A, vol. 68, pp. 042301-042306, October 2003. 

[31] S. I. Gelfand, M. S. Pinsker, "Coding for a channel with random parameters," Problems of 
Control and Information Theory, vol. 9, no. 1, pp. 19-31, 1980. 

[32] C. Aim, P. Doherty, P. Hayden, A. Winter, "On the distributed compression of quantum 
information," to appear in IEEE Trans. Inform. Theory, quajit-ph/0403042 

[33] A. Winter, F. Verstraete, J. Smolin, J. Oppenheim, M. Horodecki, "Entanglement of assistance 
and applications to multiuser quantum information theory, or transmitting partial (quantum) 
information - which can be negative!," talk given at QIP 2005, Cambridge, Massachusetts, 
January 2005. 

[34] M. Horodecki, J. Oppenheim, A. Winter, "Partial quantum information," Nature, vol. 436, 
no. 4, pp. 673-676, quant-ph/0505062 

[35] P. Hayden, R. Josza, D. Petz, A. Winter, "Structure of states which satisfy strong subaddi- 
tivity of quantum entropy with equality", Commun. Math. Phys., 246(2):359-374, 2004. 

[36] W. Diir, J. I. Cirac, P. Horodecki, "Non-additivity of quantum capacity for multiparty com- 
munication channels," iquant-ph/ 0403068 

[37] I. Devetak, P. Shor, "The capacity of a quantum channel for simultaneous transmission of 
classical and quantum information," quan t -ph/ 0311131, 

[38] J. Yard, "Simultaneous classical-quantum capacities of quantum multiple access chan- 
nels," Ph.D. dissertation. Electrical Engineering Dept. , Stanford University, March 2005, 
|quaiit-ph/05060"50| 

[39] J. Yard, I. Devetak, P. Hayden, "Capacity theorems for quantum multiple access channels," to 
appear in Proc. IEEE Intern. Sympos. Inform. Theory, Adelaide, Australia, September 2005, 
,cs. IT/0508031 

[40] I. Devetak, J. Yard, "Quantum channels with side information," in preparation. 

[41] C. Bennett, D. DiVincenzo, J. Smolin, "Capacities of quantum erasure channels," Phys. Rev. 
Lett, vol. 78, pp. 3217-3220, April 1997. 

[42] A. Wyner, J. Ziv, "The rate-distortion function for source decoding with side-information at 
the decoder," IEEE Trans. Inform. Theory., vol. 22, no. 1, January 1976. 

[43] M. Salehi, "Cardinality bounds on auxilliary random variables in multiple-user theory via 
the method of Ahlswede and Korner," Technical Report No. 33, Dept. of Statistics, Stanford 
University, August 1978. 

[44] H. G. Eggleston, "Convexity," Cambridge University Press, N.Y. 1963. 



38 



